Single-Case Experimental Designs 


Uses in Applied Clinical Research 
Dace 
avid H. Barlow, PhD, Miche! Hersen, PhD, Jackson, Miss 


hes a ealty of developing and evaluating effective treatments in 
tent ie TY and clinical psychology points out the inadequacy of cur- 
‘ Natal methodology involving comparisons of large groups. 
ican approach firmly founded in the scientific method, but 
is the pried appropriate to the study of complex behavior disorders, 
erent ingle-case experimental design. In this paper, examples of dif- 
ras we ee designs actually employed in applied clinical 
uring th are presented and discussed. Practical problems arising 
Ures eal Course of research are highlighted and some basic proce- 
of findin ned. General questions on variability, representativeness 
CUsse, 4. 8S, and clinical versus statistical significance are briefly dis- 


A maior Stumbling block to the development and 
and din tation of therapeutic techniques in psychiatry 
eanin ee psychology has been the difficulty in executing 
ta fae ul clinical research. The traditional experimen- 
oing ou control group, statistical analysis method of 
Match nacarch has not lent itself readily to psychiatry. 
Matolo g large groups of patients with similar sympto- 
8 area 1S often impossible, even if one can afford the 
aying ae costs of gathering data, following subjects, 

eet, mental therapists, and analyzing the data.' 
“ontro] al considerations of withholding treatment from 
8roup patients, even if they eventually receive 


Teat 
Seana, have also, rightly or wrongly, inhibited serious 


beeniilternative approach, particularly appropriate for 
8 c research, is the single-case experimental de- 
analysie ae from the case study method of psycho- 

€ntal (0 n the one hand, and the laboratories of experi- 
Yas prob led psychology on the other, this approach 
Nigar ly first applied to clinical problems by Shapiro.’ 
“n, theoretical and logical aspects of this research 


trat 
“8Y have been discussed by Sidman,’ Dukes,‘ Chas- 


Acre 
Prone 4 for publication Jan 8, 1973. 
Wy 889), ang the wralty of Mississippi Medical Center (Drs. Barlow and 
i € Veterans Administration Center (Dr. Hersen), Jackson, 
print r 
Medical Gann ests to Department of Psychiatry, University of Mississippi 
", Jackson 39216 (Dr. Barlow). 


teh G : 
en Psychiatry /Vol 29, Sept 1973 


san,° Baer et al,“ and Davidson and Costello.’ Research ar- 
ticles employing the strategy are increasing in various 
psychiatric and psychological journals. 

In addition to the economic and ethical issues noted 
above, the single-case design has several major advan- 
tages for applied clinical research whether the variables 
under study are psychotropic drugs or interpersonal pro- 
cesses. 

Generality of Findings.—The first advantage is concerned 
with generality of findings. If an experimental group of 
50 patients does statistically better than a control group 
of 50 patients, such differences could be due to a small 
number of patients in the experimental group showing 
larger changes while the majority of the patients show no 
changes or perhaps deteriorate slightly.* These individual 
variations are masked in the group average. Furthermore, 
as Chasson’ points out, the group-statistical design does 
not permit conclusions as to particular patient character- 
istics correlated with improvement or deterioration. These 
data also are lost in the statistical analysis. Such findings, 
therefore, are not readily translatable to the practicing 
clinician. In the single-case design, where each patient 
serves as his own control in a separate experiment, effec- 
tive treatments can be linked with specific patient charac- 
teristics that are immediately relevant to the clinician. 

Clinical vs Statistical Change.—In a group design a treat- 
ment “works” if it produces a statistically greater effect 
than a control procedure. While this type of finding is very 
important in basic research for theoretical reasons, in clin- 
ica] research these changes in patients may be so small as 
to be statistically significant but clinically useless. In 
single-case designs the size of the behavioral change in 
given patients is easily observed, facilitating judgments 
on clinical utility. Although data from single-case designs 
ean be analyzed statistically if results are weak or un- 
clear,’ in practice this is seldom necessary. 

Mechanisms of Therapeutic Change.—Group experimen- 
tal designs often test vaguely defined global treatments 
against no treatment, particularly in psychotherapy stud- 
ies. This strategy obviates an analysis of specific mecha- 


Single-Case Designs/Barlow & Hersen 319 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 


nisms of change or “active ingredients” in a therapy which 
can then be combined with other active ingredients and 
used effectively by clinicians on specific patients. Single- 
case designs are particularly well suited for teasing out 
therapeutically active ingredients in a composite treat- 
ment variable. 

Variability.—Finally, in group designs the effectiveness 
of a given treatment is usually assessed just once after 
treatment is completed. This strategy precludes an exper- 
imental analysis of the patient’s course during treatment 
which, as every clinician knows, may vary considerably 
from day to day. A single-case design, where measures are 
continually taken, allows the clinical researcher to observe 
variability and to hypothesize which correlated environ- 
mental or personality variables may be active. New hy- 
potheses may then be subjected to an immediate experi- 
mental analysis. 

The purpose of the present paper, then, is to review 
single-case designs that have been employed in clinical re- 
search while providing examples of their use. Methodolog- 
ical and practical problems that arise during the execution 
of such research will be considered. Moreover, typical solu- 
tions to these problems will be outlined. 


General Considerations in Single-Case Designs 


A large variety of simple and complex experimental 
single-case designs have been developed by clinical re- 
searchers examining the effects of an equally large vari- 
ety of therapeutic variables in decreasing psychopathol- 
ogy. Despite the differences among designs there are some 
common features in each. As in group designs, target be- 
haviors (whether motor, physiological, or attitudinal) are 
clearly specified and methods of measurement are pre- 
cisely defined. Unlike group designs, however, continuous 
measurements are taken throughout each phase of the 
study. In most designs there is an initial period of obser- 
vation (baseline phase) in which the natural frequency of 
occurrence of the specific behavior is obtained (eg, emis- 
sion of tics per unit time in a ticqueur, or scores on a de- 
pressive scale for several days prior to therapeutic inter- 
vention). Baseline measurements are generally continued 
until a stable pattern emerges. A minimum of three sepa- 
rate observation points, plotted on the graph, during this 
baseline phase are required to establish a trend in the 
data. The most desirable trend is a steady rate of behavior 
(eg, number of tics per minute) so that the effects of treat- 
ment, either beneficial, detrimental, or no effect, will be 
clear. A second trend in baseline which is common and ac- 
ceptable in clinical research is one in which the patient is 
getting worse, a process that may have been going on for 
some time. With a deteriorating baseline, beneficial ef- 
fects of treatment (as well as no effects) are clear. Detri- 
mental] effects, however, are Jess likely to be identified 
when behavior in baseline is already deteriorating. For 
the sake of convenience, the baseline phase of all designs 
will be labeled as the A phase throughout this paper. 

In most designs, a therapeutic variable or treatment is 
introduced following establishment of baseline trends. In- 
troduction of the therapeutic variable, then, represents 
the B phase of the experiment. Here too, a minimum of 
three separate observation points, and often more, are re- 


320 Arch Gen Psychiatry/Vol 29, Sept 1973 


quired to determine if the treatment is effective or not 
and whether the effect is beneficial or detrimental. !” 
single-case, as in group, design it is most desirable to e™ 
ploy blind evaluation of results. In single-case desig”: 
this means that therapists are not aware of the data du" 
ing the experimental phases and those collecting the dat4 
do not know which phase of treatment the patient is ¥” 
dergoing. 


A-B Design 


The A-B experimental single-case design, briefly alluded : 
above, is the most basic of all designs. It represents a definite! 
provement over the uncontrolled case study or case history in i 
the target behavior is measured, and the effect of the introduct!” 
of a therapeutic variable can be determined by comparison ™' 
measured baseline rates of the behavior. However, the A-B desis 
must be classified as a correlational design inasmuch as mere - 
stitution of the B phase does not permit unequivocal conelusio® 
as to the controlling effects of that therapeutic variable. More sp” 
cifically, changes brought about as a consequence of introduc!® 
the B variable may possibly result from its correlates rather th? 
from its controlling effects. by 

An excellent example of the simple A-B design is presented * 
Leitenberg et al. They examined the effects of selective posit!" 
reinforcement on caloric intake and weight in an anorexia ne? yf 
patient. During baseline (A phase) the patient was give? fo \ 
meals daily, each consisting of 1,000 calories. The patient w4® 4 
lowed 30 minutes per meal and was instructed that eating". 
structured situation (a special hospital room) would lead 1? ft 
provement. The aforementioned conditions were maintained a 
ing the B phase, but reinforcement was added. Reinforce™ 
consisted of social praise contingent upon increasing consumP va 
of food. In addition, privileges were made contingent up" nt 
creased consumption. Examination of Fig 1 reveals that wee 4 
maintained relative stability while caloric intake decre# of 
slightly throughout the 50-day baseline phase (A). Institution 
reinforcement (B) resulted in a marked linear increase of b0 : 
loric intake and weight over the 50-day period. Although thé" 
suggest effectiveness of the reinforcement (eg, attention-pla? i? 
expectancy, time, etc), only by removing reinforcement (4 re Yr 
to A), while holding other correlational therapeutic variable vd 
stant and noting decrement in target responses (calori@’ 
weight), is it possible to claim unequivocally that the rein at 
ment technique was the sole responsible agent of change. The gt 
ter design is known as the A-B-A experimental single-case e 


A-B-A Design 
is ; 


Hersen et al'' used an A-B-A design in assessing effec inl? 
general work token economy on neurotic depression. erste 
earned and behavioral ratings of depression (high ratings ind! pr 
low depression) were the two target behaviors under stu yell 
amination of Fig 2 for one neurotic depressive reveals re!# gio" 
stable measurements in baseline, with a slightly upward tré rat 
points earned and a slightly downward trend for behaviOF” ic 
ings. Institution of token economy in phase B led to a 4°? gl of 
upward trend in points earned and behavioral ratings. Remo 


e 
token economy and a return to baseline in the second A phas 19 


sulted in marked decreases in both points earned and pot if 
ratings. After further replication on other patients, the au itiv? 
concluded that institution of token economy effected P', ti! 
changes in this type of depression. It should be underscore’ 68 
is only the obtained reversal in the second A phase that per ke? 
firm conclusion with respect to the controlling effects ° ond? 
economy on the two target behaviors. The reversal in the set er? a 
phase confirms that changes obtained in target behaviors 


f 
ess? 
Single-Case Designs/Barlow © # 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 


: 


A ao eS 


direct function of the institution and removal of the Token Econ- 
omy treatment variable. 
_ Although the A-B-A design is superior to the simple A-B design 
In that it permits unequivocal conclusions on the basis of the ex- 
Perimental analysis, two major problems are present. First, ther- 
py for the patient who is being simultaneously treated and eval- 
Uated in this paradigm ends on the A or no-treatment phase of 
study. This was the case in the Hersen et al' study. One of the pa- 
tients was discharged prematurely at his own request. For the 
other two, changes in medication necessitated termination of the 
xperiment on the A phase, but clinical treatment was then com- 
pleted. On an ethical and moral basis, it certainly behooves the ex- 
berimenter-clinician to continue some form of treatment to its 
ultimate conclusion subsequent to completion of the research as- 
Pects of the ease. A further design, known as the A-B-A-B design, 
Meets this criticism as study ends on the B or treatment phase. 
his design has the added advantage of providing two occasions 


e~Calories 
*“Weight 
$1 


rearer 


Nonreinforcement Reinforcement 


1 3 #5 7 9 1 #18 #%6 #417—~«19 
Days (Biocks of Five) 


Fj 

of a 1.~Effects of nonreinforcement and reinforcement in a case 

Printed 2 nervosa for subject 1. (From Leitenberg H et al'*) re- 
by permission of the authors, editor, and publisher.) 





Fig 2. 
Subject 
thors 


—Number of points earned and mean behavioral ratings for 
1. (From Hersen M et al"; reprinted by permission of the au- 
' @ditor, and publisher.) 


“4 Points Earned 
°—© Behavioral Ratings 







30 








Pr 
oOo 


Number of Points Earned 
sBuney jeioieyeg ueoy 





Token 
y Reinforcement 






Baseline 


5 6 7 8 9g 10 11 12 


Days 


teh G 
en Psychiatry/Vol 29, Sept 1973 


for the treatment variable to demonstrate a positive effect, thus 
further strengthening conclusions as to its actions. 


A-B-A-B Design 


Miller’? used an A-B-A-B experimental single-case design in an 
analysis of retention control training (RCT) in two “secondary 
enuretic” children. The number of enuretic episodes and mean 
frequency of daily urination were selected as target behaviors. 
During baseline (A) the patient was instructed to record target 
behaviors (Fig 3). He also received weekly treatment sessions con- 
sisting of discussion of troublesome situations that had occurred 
in the previous week. RCT (B) consisted of initially training the 
patient to refrain and postpone urination for ten minutes each 
time the urge was experienced. This was increased to 20 and 30 
minutes in the following weeks. In addition, the patient was in- 
structed to increase consumption of fluids throughout each day. 
During the second baseline (A) the patient was instructed to dis- 
continue RCT, and in the following B phase it was reinstated. An 
examination of the data indicate that stable measurements were 
obtained in baseline. Application of RCT resulted in marked de- 
creases in target behaviors. Removal of RCT in the second base- 
line led to both increased enuretic episodes and increased fre- 
quency of daily urination. Original baseline levels were achieved. 
Reintroduction of RCT led to decreased frequency of urination 
and eventual absence of enuretic episodes, This treatment was 
then continued until enuresis was virtually eliminated. As previ- 
ously noted, this represents one of the major advantages over the 
less complete and less complex A-B-A design. In the Miller'* ex- 
periment a double reversal in the data was obtained, indicating 
that the controlling effects of RCT could be replicated within the 
same patient, lending further credence tc the effects of this treat- 
ment. An important experimental strategy to consider at this 
point is the length of the phases. Each phase should contain a rela- 
tively equal number of observations to insure that effects are due 
to the treatment variable. For instance, in the Miller'? experiment 
enuresis could have worsened slightly at the very beginning of the 
B phase and then improved rapidly. Enuresis might have wors- 
ened somewhat again when the A phase was reintroduced. If the 
A phase was then stopped shortly thereafter, it could not be deter- 


Fig 3.—Number of enuretic episodes per week and mean number 
of daily urinations per week for subject 1. (From Miller PM**; Behav 
Ther, reprinted by permission of the author, editor, and publisher.) 


«~-« Daily Urination 

o—°o Enuretic Episodes 
Retention 
§ Control 

Baseline | Training | Baseline 


Retention 
Control 
Training 


—_ 
o 


3 
uoyeuur Ateg yo Aousnbar4 uray) 


oO. YO oO f OA DN wo 0 
See eezvearee en eee aessaeueseeans 


Number of Enuretic Episodes 
7 
i 
i 
a 
a 
4 
Or NO WwW hAD™~ @ CO 





123 4 5 6 7 8 9 10 11 12 13 14 
Consecutive Weeks 


Single-Case Designs/Barlow & Hersen 321 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 





> i i i = 
38 30 Baseline i Acquisition Extinction jReacquisition 10 = 
= i 9 
2 25 ' i Total Urges Ommo / = 
= i 1 Card Sort e--@ | 8 
& 29 O1 7 8 
& 6 & 
515 l 7 
B y , i i Soc 
U ! i 7A 

& i od \ 4 8 
et 3 o 
5 i 8 
B 5 7 .e a & 
3 , ! an ee 
. ne =n 
12345 678 91011 1213141516171819 2021222324 = 


Experimental Days 


Fig 4.—Total score on card sort per experimental day and total 
frequency of pedophilic sexual urges in blocks of four days sur- 
rounding each experimental day. (Lower scores indicate less sexual 
arousal.) (From Barlow DH et al'*; copyright 1969 by the American 
Psychological Association, and reproduced by permission.) 


mined that the return to baseline produced the reversal. Such a re- 
versal might have been due to some correlated event occurring at 
the change of treatment. Moreover, enuresis would have improved 
if the A phase had been extended to equal the previous B phase. 
In all single-case experiments, phases should be as nearly equal as 
possible. 


B-A-B Design 


Some researchers have used a variant of the A-B-A-B design, 
labeled the B-A-B design, in which the initial baseline phase is 
omitted.'*" Others include an initial but abbreviated A phase in 
which only one data point is obtained. In both cases these de- 
signs are superior to the A-B-A design as study ends on the treat- 
ment or B variable. However, these designs do not allow for initial 
baseline assessment (A), and, consequently, it is not possible to 
determine changes brought about as a result of first introducing 
the B therapeutic variable. From both a research and clinical 
standpoint the use of the complete A-B-A-B design is recom- 
mended where possible for examination of the effects of singular 
therapeutic variables on behavior. 


A-BC-B-BC Design 


When the controlling effects of specific aspects of treatment 
techniques (eg, use of the noxious scene in covert sensitization) 
are to be examined, the experimental single-case design of choice 
is the A-BC-B-BC design.’* The A-BC-B-BC design is structurally 
similar to the A-B-A-B design, but procedurally different. As in 
the A-B-A-B design, the first phase of the A-BC-B-BC design in- 
volves a baseline assessment of target behaviors. In the BC phase 
a composite treatment variable is introduced, and changes in tar- 
get behaviors, if any, are recorded and plotted graphically. In the 
B phase one aspect of the treatment variable is omitted in order to 
assess its controlling effects over target behaviors. If indeed that 
portion of the technique is critical for therapeutic success, im- 
provement in target behaviors should cease or be reversed. By 
contrast, when that portion of the treatment variable is reintro- 
duced in the BC phase, a second reversal should be obtained as in- 
dicated by renewed improvement in target behaviors. 

A clear example of the A-BC-B-BC design is presented by Bar- 
low et al’ in their assessment of the effects of the noxious scene in 
covert sensitization (a form of imaginal aversion therapy often 
used in treatment of sexual deviation and addictions). This was 
examined in one case of pedophilia and another of homosexuality. 


322 Arch Gen Psychiatry/Vol 29, Sept 1973 






Phases: 1; | 3 ee ee ae ee 
120 a ar a 
#100 Vy 
as] 4 4 ¢ H 7 : 
e 5 5 8 j 7 ‘ 
8 i ; 
8 : : : : ; 
a 80 24 & 
£ i NN : 
E 60 JOON 
c : : 
& 40 : 
= 2 


iw) 
Oo 


i No FB ES 2 8 
: : : FB : No: FB : : 

}: ; : : ft 
EEelene : ce rlee ‘Alone Praise:Alone Praise Alon? 
5 10 15 20 2 30 35 4 * 

Blocks of Four Sessions (40 Trials) 


Fig 5.—Time in which a knife was kept exposed by a phobic ie 
tient as a function of feedback (FB), feedback plus praise, an 3 
feedback or praise conditions. (Fig 2, p 136, from Leitenberg H ; 
al’; copyright 1968 by Society for the Experimental Analysis of 
havior, Inc., and reproduced by permission.) 


Two target behaviors were selected for study. One consisted : 
the patient’s daily responses to a card sort containing a hierar 
of sexually arousing scenes. The number of daily urges towry 
“immature girls” was the second target behavior for the P°’ 
philic patient, During baseline (A) operant rates of these taré 
behaviors were recorded. Examination of Fig 4 shows an upwi), 
trend in both total urges and card sort scores during baseline on 
Covert sensitization procedures were then applied in acquisit! t 
(BC). Treatment involved daily imaginal presentation of dev@ 3 
sexually arousing scenes paired with verbal descriptions of 8" di- 
and vomiting by the therapist. Examination of the figure”, 
cates a dramatic linear decrease of urges and card sort scores." j. 
ing acquisition (BC). The specific or controlling effects % 14 
noxious scene were then examined in extinction (B) for both 0 v 
urges and card sort. Reintroduction of the noxious scene inf rd 
quisition (BC) led to a renewed decrease of total urges and as 
sort scores, thus illustrating the direct controlling effects of P art 
ing the noxious scene with the sexually arousing scene in © 
sensitization. int 

It might be noted that a relatively equal number of data ae . 
are present in each experimental phase of the Barlow et al’ stu of 
Secondly, it should be underscored that after introducti® 4¢ 
treatment only one variable at a time was altered from one P tio 
to the next (eg, elimination of the noxious scene in extin¢ ing 
while maintaining all other procedures as constants; reintrodv ri 
the noxious scene in reacquisition). If, on the other hand, tw? ° 4. 
ables were to be manipulated simultaneously, it would not bé he 
sible to ascertain which of the two accounted for changes hare 
target behavior. Changing one variable at a time across condil! atv 
is an important guide rule in carrying out all types of expe!! of 
tal single-case research. Moreover, this rule is of particular iP 
tance when several therapeutic variables are simultaneously 
ent. 


Combined Designs 


fe 
Combined designs have been used by a number of research” ¢ 
during the course of their attempts to assess additive effect. 
particular psychotherapeutic variables on target behavior® | js 
More specifically, if one therapeutic variable such as feed 3 tal 
shown to effect changes in a target behavior, an experi ond 
question might be raised as to the additional effects of 4 se 0" 
therapeutic variable (eg, reinforcement) on that same beh® es” 
The additive effects of the aforementioned two variables w° of 
amined by Leitenberg et al in their experimental treatme? «8 
knife-phobic patient. The target behavior selected for stu% | se 
the amount of time (in seconds) that the patient was able 


ge? 
Single-Case Designs/Barlow & Her 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 









Phase:1i 2 | 31445 361374 
(B) } (BC) | (B)I (A) 1) Lec} (B)} 

ae eee eee a ee 

8 120 a a a 
$ 1 H 
@.1e0 a re ae 
‘ ‘ ' ¢ ‘ t 7 

Sol | so | 
v ' i a ‘ a H 
a a ea 

= 60 POM GE bt 
: | os on ae oe 
g 40 f $tNof $ $ 4 
as eo Oe ee 
‘FB + Praise! FB Praisd FB Praise FB } 


25 30 35 40 45 


5 10 


15 20 


Blocks of Four Sessions (40 Trials) 


tien? 6.—Time in which a knife was kept exposed by a phobic pa- 

ack &8 a function of feedback, feedback plus praise, and no feed- 

ira OF praise conditions. (Hypothetical data based on Fig 2, p 136, 
m Leitenberg H et al.'*) 


main in th 
‘ype of de 


~ 


€ presence of the phobic object behind a closed door. The 
Sign used for this experimental analysis was a B-BC-B- 
inati B design. B represented the feedback variable, BC a com- 
consists of feedback and praise, while A was baseline. Feedback 
time g of informing the patient after each trial as to amount of 
men Pent in all ten trials. Praise consisted of verbal reinforce- 
time or; €never the patient exceeded a progressively increasing 
te riterion, The results of this study are presented in Fig 5. 
the datg: edback (B) a marked upward linear trend was noted in 
Upward | The addition of praise in phase 2 (BC) did not change the 
change trend. Removal of praise in phase 3 (B) did not yield a 
Was the c the slope of the curve, suggesting that feedback alone 
rey. erga) nical variable controlling change. In phase 4 (A) a short 
feedba was obtained, confirming the controlling effects of the 
¢k alone variable. Reintroduction of feedback in phase 5 (B) 
(BC) eae improvement. The addition of praise in phase 6 
‘aise fae ted in a continued upward trend. However, removal of 
the glo Phase 7 (B) once again failed to bring about a change in 
re a the curve. In short, changes in the target behavior 
additi, Y 4 function of feedback alone. Praise did not produce 
emonstre« effect inasmuch as its controlling effects were not 
tison ee in the experimental analysis. For purposes of com- 
teplotte a illustration, data from the Leitenberg et al’* study are 
Variable to demonstrate the shape of the graph had the praise 
Will be a €eted a controlling influence on the target behavior. It 
Upward : €d that in our hypothetical data in Fig 6 that a slight 
'N phage Te was obtained in phase 1 (B). The addition of praise 
Slope of th (BC) resulted in a steep increase and a change in the 
but les curve, Removal of praise in phase 3 (B) led to a contin- 
Phage 4 ( ‘ S marked increase. Removal of feedback and praise in 
of feedbar? resulted in a reversal of the data while reinstatement 
of praj In phase 5 (B) effected a slight upward trend. Addi- 
Ward teen; 18€ In phase 6 (BC) once again resulted in a steep up- 
Praige in ph and change in the slope of the curve. Removal of 
trend, he ase 7 (B) resulted in a continued but slightly upward 
ables Wer, Eby Pothetical data suggest that both therapeutic vari- 
Changes in effective, Feedback led to slight changes but marked 
Added, the slope of the curve were noted when praise was 
that Variable Usttating both additive and controlling effects of 
Ns ‘ 
examined “ases, as many as three and four variables have been 
© comb} quentially and in combination.” Many variants of 


Rc, BCR. design are possible; three examples are A-B-BC-B- 
B, and A-B-BC-BCD-BC-BCD. 
; Multiple Baseline Designs 
Nn all 


: of ‘ : 
Point, the Dee, experimental analysis designs presented to this 
versal technique (removal of the treatment variable 


Arch g 
“" Peychiatry/Vol 29, Sept 1973 






Erection Latency at End of Each Aversion Session 





60 
% 40 ; Skirt and si 
“N. 20 Panties ajama: Blouse P 


Sessions 


012345 012 


0123456 012 


e) 
- 
fe} 
<= 
a. 
a 
oO 
3 


Deflections (max, 60)” Latency ( 





Erections After One-Minute Exposure to Stimulus 


Fig 7.—Specificity of autonomic changes (patient B). (From Marks 
iM and Gelder MG**; reprinted by permission of the authors and edi- 
tor.) 


or a relevant portion of it) has been used to demonstrate the con- 
trolling effects of the therapeutic variable under study. There are 
times, however, when the reversal technique may not be feasible, 
particularly when treatment considerations argue against its ap- 
plication. An alternative method to demonstrate the controlling 
effects of therapeutic variables in single subjects is known as the 
“multiple baseline” technique.’ In this method of study specific 
but independent target behaviors are identified and precisely de- 
fined. A baseline measurement of each target behavior is estab- 
lished, following which a particular therapeutic technique (eg, 
electrical aversion) is applied to the first of the target behaviors. 
If the technique is successful and the selected target behaviors 
are truly independent of one another, changes in the first target 
behavior should appear while little or no change is noted in the 
others. Subsequently, the technique is applied to a second target 
behavior, and changes are again noted. However, such changes 
should not be found in the remaining untreated behaviors. Baer et 
al‘ argue that: ‘The experimenter is attempting to show that he 
has a reliable experimental variable, in that each behavior 
changes maximally only when the experimental variable is ap- 
plied to it.” In application, the “multiple baseline” design is com- 
pleted when the therapeutic variable has been administered to 
each of the designated target responses. There are no specific 
rules as to how many target behaviors are needed to establish con- 
trol and specificity of treatment, but the controlling effects of that 
technique over at least three target behaviors would appear to be 
a minimum requirement. 

The “multiple baseline” design was used by Marks and Gelder” 
in their assessment of electrical aversion therapy in treating sex- 
ual deviation. One of their study patients was a young male trans- 
vestite. Baseline assessment indicated that sexual arousal (mea- 
sured via a penile transducer) was maximal when he either 
observed or touched one of several stimuli (panties, slip, skirt, 
woman’s pajamas) for a period of one minute. All of these stimuli 
had previously been used in his cross-dressing episodes. Similarly, 
the patient responded maximally to a photograph of a nude fe- 
male. Following baseline assessment, a course of electrical aver- 
sion consisting of about 20 trials was administered to the patient 
in relation to each of the target stimuli (panties, slip, skirt, 
woman’s pajamas) in sequence. Erection latency to each target 
stimulus following aversion sessions is presented at the top of Fig 
7. Strength of penile erection after a one-minute exposure to tar- 
get stimuli is presented at the bottom of Fig 7. It will be noted 
that erectile strength (first block of five stimuli) prior to aversion 


Single-Case Designs/Barlow & Hersen 323 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 





No Drug Placebo Trifluoperazine Placebo Trifluoperazine 
g 14- 
eC 
5 12 
3 
nS 10 
S 8 
6 
@ 6 
Oo 4 Ne 
2 
zo a 
a] 
a 
= 





7 9 11°13 15 17 19 21 23 25 
Sessions 
Fig 8.—Average number of refusals to engage in a brief conversa: 
tion during 18 random time samples per day by patient. Tri- 
fluoperazine (Stelazine) dose was 60 mg daily. Each session repre- 
sents the average of a two-day block of observation. (From 
Liberman RP, et al: Research design for analysing drug-environ- 
ment-behavior interactions. J Nerv Ment Dis, 7° reprinted by per- 
mission of the authors and the Williams & Wilkins Co, Baltimore.) 


was maximal, with the exception of the “slip” stimulus. The sec- 
ond block of five stimuli shows erectile strength following electri- 
cal aversion in connection with the panties stimulus. Erectile 
strength in response to panties was extremely low, but it was at 
maximum strength for the other four stimuli. The third block of 
five stimuli depict erectile strength following sequential aversion 
with respect to panties, skirt, and woman’s pajamas. Examination 
of that block reveals maximum erectile strength to the nude photo 
and marked erectile strength to the untreated “slip” stimulus. 
Aversion was then applied in relation to the “slip” stimulus, with 
a resulting decrease in erectile strength (fourth block). Erectile 
strength towards the nude photo remained at its maximum, indi- 
cating the specificity of aversion treatment to these stimuli. In 
this study erectile responses to the nude photo may be conceptua- 
lized as a control for the other four stimuli. If erections to un- 
treated articles of clothing and the nude photo had decreased, 
then one would either conclude that electrical aversion had some 
general effect on sexual arousal or that a correlated therapeutic 
variable (eg, expectancy) was the crucial therapeutic agent. In 
summary, Marks and Gelder”? demonstrated, both sequentially 
and differentially, the controlling effects of electrical aversion over 
erectile responses to five target stimuli selected for study. 


_ Side Effects 


From the foregoing description of single-case research, it be- 
comes apparent that a wide variety of problem areas have been 
studied. However, there are still other features and applications 
of these designs that have not been discussed. For example, some 
investigators’*” have not only examined the controlling effects of 
particular variables on designated target behaviors, but have also 
measured and assessed the “side effects” of these same variables 
on other ongoing nonmanipulated behaviors. Sajwaj et al** point 
out that such covariations in nonmanipulated behaviors may pos- 
sibly result in both socially desirable and undesirable side effects. 
This is of particular importance in the case of undesirable side ef- 
fects, as additional application of techniques will then be required 
to exert appropriate controls over behaviors. 


Effects of Drugs 


The experimental single-case design is also well suited for ex- 
amination of the effects of pharmacological agents on behavior.* 
Using experimental analysis designs under double-blind condi- 
tions, one might sequentially administer a placebo, drug X, a pla- 
cebo, and once again drug X (A-B-A-B) while observing concomi- 


324 Arch Gen Psychiatry/Vol 29, Sept 1973 


tant behavioral changes. One might also examine two drugs i? 
sequence (placebo, drug X, placebo, drug Y, placebo, drug X, pl# 
cebo, drug Y), and in other instances the additive effects of drug® 
(placebo, drug X, drug X and drug Y, drug X, drug X and drug ¥). 
Since continued measurements are in effect, length of phases ¢4" 
be varied from experiment to experiment to determine precise 
the latency of drug effects after beginning the dosage and the Fr 
sidual effects after discontinuing the dosage. 

An example of the effects of a drug on a selected target beha” 
ior in a within-subject reversal design is presented by Liberm#® 
et al. The effects of trifluoperazine (Stelazine) were assessed im 


‘an A-Al-B-A1-B design in a 21-year-old withdrawn male schi#” 


phrenic. The target behavior chosen for study was the patient® 
willingness to engage in five-minute chats initiated by a membet 
of the nursing staff (blind to conditions) 18 times daily at ra” 
domly selected times. On day 1 the patient was withdrawn fro! 
his trifluoperazine medication and an examination of Fig 8 in@” 
cates that during the A phase (no drug) the number of asocial . 
sponses (unwillingness to chat) increased sharply. Institution ° 
the placebo in the Al phase (placebo) resulted in an initial decres® 
followed by a marked linear increase in asocial responses: 
phase B (trifluoperazine) a dosage of 60 mg/day was introduce? 
A marked decrease in asocial responses resulted. When placé f 
was reintroduced in the Al phase (placebo), a reversal was not? 
as indicated by the marked increase in asocial responses. Rein 
duction of trifluoperazine (60 mg/day) in the second B phase led 

a second reversal (decrease in asocial responses), thus suggestl® 
the controlling effects of the drug. The conventional double-bli# 
design used in group drug studies was not quite applicable ere 
Although the patient was unaware of placebo conditions, the i 
perimenter (physician) was obviously aware of the drug being 4 , 
ministered. However, the assessor of the target behavior (eg) ve 
nurse) was blind to the condition in force. In that sense the SP! 
of the double-blind design is approximated and maintained. 


Comment 


Obviously the single-case design cannot answer all dif 
cal research questions. A first limitation occurs if of 
wishes to compare two global treatments, both of whi f 
are effective on a given behavior disorder. To find ° 
which treatment is more effective would be difficult to A 
swer using single-case methodology. In this case a grou 
design, where each treatment is administered to a seF 
rate set of patients and where results are analyzed sta 
tically, would be more appropriate. ‘ 

One must question, however, the usefulness of net 
mining which treatment is statistically better whe” ~ 
ferences, unless extraordinarily large by statistical soft 
dards, are clinically unimportant. This issue has 
raised by Bergin and Strupp’ who, in a thorough review 
psychotherapy research, concluded that any effort 
signed to evaluate global treatments such as “psychot at) 
apy” is likely to produce “weak” (clinically insignific’ iy 
results. A more productive approach at this stage % ip 
development of behavioral change techniques would , 
determine through single-case designs those active m 
dients that are effective in both treatments. These 1" i" 
dients could then be combined into a more effective ° 
posite therapy. we of 

A second limitation of single-case designs arises if alll 
wishes to test variables that are irreversible or pa?" ip 
irreversible (eg, the effects of surgical lesions or cet the 
types of therapeutic instructions). Here the effect of 
therapeutic variable continues in subsequent phase® 














rs 
Single-Case Designs/Barlow & He 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 


~—, eee eee eee eae eeee eee nena ee anne ene ne TT TT TT, LOT TLS TC TCE, «TTT TPP 7A  PY ce es e TPT—PaP TS SP SS a VP POV 


ducing what is termed “carry over effects” or “sequential 
hfounding of results.” These therapeutic variables may, 
times, be tested in multiple baseline designs where sev- 
target behaviors are independently measured, but 
SS 4 group comparison is the only alternative. (Note, 
be overs in some instances therapeutic instructions can 
tested in single-case designs.”*) 
: ‘ Similar difficulty ensues if one is testing a therapeutic 
ty nele with long-lasting effects such as some pharma- 
gic agents. Here one must balance the disadvantages 
;, dlternating the drug and a placebo in phases each last- 
8 several months with the previously mentioned dis- 
vantages of the group comparison. 
‘nally, one of the most important aspects of research is 
it, Senerality of the conclusions. One of the assumed 
®ngths of group designs is that results are applicable to 
thir tients carrying the same diagnosis. The fallacies of 
it 288umption were discussed in the introduction where 
Was noted that a few people may improve a great deal 
doe deteriorate somewhat, producing an overall sta- 
Mheas Improvement but losing important individual dif- 
te in the group average. Although single-case de- 
8 provide clinicians with information on effects of 


treatment for patients with the same characteristics, 
their results also may not be applicable to all patients with 
a similar behavior disorder. The answer to the problem is 
to apply systematically the therapeutic variable to cases 
with different background behaviors or “personality” vari- 
ables, an approach which Sidman’ refers to as “systematic 
replication.” Failures with a systematic replication series 
can then be ascribed to specific patient characteristics. Ex- 
perimental analyses should then be performed to deter- 
mine necessary alterations in therapeutic procedures. This 
strategy highlights individual differences rather than 
averaging them out. 

This present review is by no means an exhaustive ac- 
count of single-case experimental designs or strategy. 
However, suitability of this approach to clinical research 
should lead to many variations of these designs as we 
strive to answer complex questions concerning the treat- 
ment of human behavior disorders. 


Preparation of this manuscript study was supported in part by grant 
MH-20258 from the National Institute of Menta! Health and by Veterans 
Administration grant 5-71 (Jackson Veterans Administration Center, Jack- 
son, Miss). 


References 


l : : ‘ 
B Bergin AE, Strupp HH: Changing Frontiers in the Science of 
Cho pp g 
SnaraPy Chicago, Aldine-Atherton, 1972. 
lp ical ’piro MB: The single case in fundamental clinical psycho- 
: Sidr arch. Br J Med Psychol 34:255-262, 1961. ; 
ks t man M: Tactics of Scientific Research. New York, Basic 
4p ne Publishers, 1960. 


5 qukes WF: N-1. Psych Bull 64:74-79, 1965. 
a chiateR JB: Research Design in Clinical Psychology and 


ry. New York, Appleton-Century-Crofts, 1967. 

&% ie? ns DM, Wolf MM, Risley TR: Some current dimensions of 
, D ehavior analysis. J Appl Behav Anal 1:91-97, 1968. ; 
¥ Sin avidson PO, Costello CG (eds): N-1: Experimental Studies 

g Be € Cases. New York, Van Nostrand-Reinhold Co, 1969. 
they ee AE: Some implications of psychotherapy research for 
, sat practice. Int J Psychiatry 3:136-150, 1967. 
Node] fovile JR, Roden AH, Klein RD: An analysis-of-variance 
8S 19g 196 intrasubject replication design. J Appl Behav Anal 
vf the weitenberg H, Agras WS, Thomson L: A sequential analysis 
"xin by Sct of selective positive reinforcement in modifying ano- 
y plans Behav Res Ther 6:211-218, 1968. 
Nssion: ace M, et al: Effects of token economy on neurotic de- 
t IQ Mill n Peerimental analysis. Behav Ther, to be published. 
atl in a PM: An experimental analysis of retention control 
la, eich the treatment of nocturnal enuresis in two institution- 
oe: Ay escents. Behav Ther, to be published. 
"behav; On T, Azrin NH: The measurement and reinforcement 
i 4 Leite or Psychotics. J Exp Anal Behav 8:357-388, 1965. 
(erimental erg H, et al: Feedback in behavior modification: An 
81.197 1s6a ysis in two phobic cases. J Appl Behav Anal 


Ma spuckard HC, Saunders TR: Control of “clean-up” behavior 

i,\6. Agras Wat, Behav Ther 2:340-344, 1971. 
the modif S, Leitenberg H, Barlow DH: Social reinforcement 
cation of agoraphobia. Arch Gen Psychiatry 19:423- 


th i 
Gen Psychiatry /Vo] 29, Sept 1978 


427, 1968. 

17. Whitman TL, Zakaras M, Chardos S: Effects of reinforce- 
ment and guidance procedures on instruction-following behavior 
of severely retarded children. J Appi Behav Anal 4;:283-290, 1971. 

18. Barlow DH, Leitenberg H, Agras WS: Experimental control 
of sexual deviation through manipulation of the noxious scene in 
covert sensitization. J Abnorm Psychol 5:596-601, 1969. 

19. Agras WS, et al: Instructions and reinforcement in the mod- 
yaeanon of neurotic behavior. Am J Psychiatry 125:1435-1439, 


20. Elkin TE, et al: Modification of caloric intake in anorexia 
nervosa: An Speman analysis. Psychol Rep 32:75-78, 1973. 

21, Hersen M, et al: Instructions and reinforcement in the mod- 
ification of a conversion reaction. Psychol Rep 31:719-722, 1972. 

22. Marks IM, Gelder MG; Transvestism and fetishism: Clinical 
and pey chological changes during faradic aversion. Br J Psychia- 
try 113:711-729, 1967. 
- 23. Lovaas OI, Simmons JQ: Manipulation of self-destruction in 
three retarded children. J Appl Behav Anal 2:143-158, 1969. 

2A, Risley TR: The effects and side-effects of punishing the au- 
tistic behavior of a deviant child. J Appl Behav Anal 1:21-34, 1968. 

Sajwaj T, Twardosz 8, Burke M: Side effects of extinction 

proc dates in a remedial preschool. J Appl Behav Anal 5:163-175, 


26. Twardosz S, Sajwaj T: Mutliple effeets of a procedure to in- 
crease sitting in a hyperactive retarded boy. J Appl Behav Anal 
§:78-78, 1972. : 

27. Wahler RG, et al: The modification of childhood stuttering: 
mame ae relationships. J Exp Child Psychol 9:411- 

, 1970. ; 

28. Liberman RP, et al: Research design for analyzing drug-en- 
jazonment Rene vich interactions. J Nerv Ment Dts, to be pub- 
ished. 

29. Barlow DH, et al: The contribution of therapeutic instrue- 
tion to covert sensitization. Behav Res Ther 10:411-415, 1972. 


Single-Case Designs/Barlow & Hersen 325 


Downloaded From: http://archpsyc.jamanetwork.com/ by a DALHOUSIE UNIVERSITY-DAL-11762 User on 06/20/2015 


